A constructive proof of Simpson's Rule 

o: 

^ ; Thierry Coquand 

X> ■ Bas Spitters 

in 



< 



Abstract: For most purposes, one can replace the use of Rolle's theorem and the 
mean value theorem, which are not constructively valid, by the law of bounded 
change [3]. The proof of two basic results in numerical analysis, the error term 
for Lagrange interpolation and Simpson's rule, however seem to require the full 

■ strength of the classical Rolle's Theorem. The goal of this note is to justify these 
^ (-h two results constructively, using ideas going back to Ampere [1] and Genocchi [6], 
-i— > 
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■ 1 Introduction 

cn 

q ■ Rolle's Theorem states that iff is differentiable on a real interval [a, b] with a < b and 

■ f(a) =f(b) = then there exists c in ]a, b[ such that /'(c) = 0. It implies directly the 
mean value theorem that iff is differentiable on a real interval [a, b] with a < b, then 
there exists c in ]a, b[ such that/(Z?) —/(a) = (b — a)f'(c). This is a key result in most 

^ 1 text books in Analysis. It is rather direct to see that it does not hold constructively [2], 

^ ■ and is replaced in this context by an approximated form: if / is differentiable on a 

real interval [a,b] with a < b then for any e > there exists c in ]a,b[ such that 
\f(b) — f(a) — (b — a)f'{c)\ < e. This more complex formulation can be thought to be 
a problem of constructive mathematics. 

It can be argued however [3] that most applications of the mean value theorem can be 
replaced by the law of bounded change that we have \f(b) — f{a)\ ^ M(b — a) iff is 
uniformly derivable on [a, b] and \f'(x)\ ^ M for all x in [a, b] . The law of bounded 
change is constructively valid and a presentation based on the law of bounded change [3 ] 
appears as elegant as the classical treatment of basic analysis results. Interestingly, a 
criticism of the mean value theorem, which could have been written by a constructive 
mathematician, appears in Dieudonne Foundations of Modern Analysis [4] : "the trouble 
with that classical formulation is that ... it completely conceals the fact that nothing 
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is known on the number c, except that it lies between a and b, and for most puiposes, 
all one needs to know is that /'(c) is a number which lies between that g.l.b. and l.u.b. 
of/' in the interval [a,b] (and not the fact that it actually is a value of/')." The goal 
of this note is to analyse two results that at first seem to require the full strength of 
the classical mean value theorem. The second of these results is Simpson's rule for 
approximating an integral, which is indeed proved in the exercises of [4] using Rolle's 
Theorem. We show, using some ideas going back to Ampere [1] and Genocchi [6], 
how to justify these results constructively. 

This note is organised as follows. We first present the two results we want to analyse: 
Lagrange error formula and Simpson's rule. We then explain Genocchi's formula, in a 
way which stresses the connection with the work of Bridger and Stolzenberg [3], and 
show how it can be used instead of Rolle's Theorem. 



2 Lagrange error formula and Simpson's rule 

In this section, we present two basic results, Theorems 2.2 and 2.3, and explain how 
they can be classically derived using the following generalization of Rolle's Theorem. 

Theorem 2.1 (Classical generalized Rolle's theorem) Let/ be n times differentiable 
and have n + 1 zeroes in an interval [a, b] . Then f {n) has a zero in [a, b] . 

Proof If / has n + 1 zeroes xo < x\ < ■ ■ ■ < x„ then using Rolle's Theorem /' 
will have n zeroes y®, . . . ,y n -i with yi in Hence we obtain the result by 

induction on n . □ 

We now present the Lagrange polynomial as it can be found in numerical analysis 
textbooks; e.g. [5]. 

Theorem 2.2 (Lagrange error formula) Letf be n times differentable on an interval 
[a, b] , P the polynomial of degree n — 1 which agrees withf on n values xq < • • • < 
x n ^\ and M such that \f^"\x)\ ^ M for all x in [a, b] . Then for all x in [a, b] 

\f(x)-P(x)\^ miX ~ Xk)l M 
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Proof This is proved using Rolle's Theorem in the following way [4, 5]. First, 
classically, we can assume that x is not equal to one of the x,- since the inequality is 
clear if x = x\ . We then consider the function 

g(y) = ; — (TOO - P(y)) ; — (fix) - Pi*)) 

This function is n times differentiable and has n + 1 zeroes x, xo, . . . ,x n _i . Using 
Theorem 2.1, there exists c such that g( n '(c) = which can be written 

f{x) - P(x) = n( *7**V w) (c) 
hence the result. □ 

By a similar use of Theorem 2.1, one can derive the following classical result [4, 5]. 

Theorem 2.3 (Simpson's rule [5]) Iff is 4 -differentiable on an interval [a,b] and 
\f (4 \x)\ ^ M for all x in [a,b], then we have 



b 



fix) dx 



+ 4/1 a -^~ ) +fib) 



5 



^ib-a) 
2880 



Rolle's theorem is constructively provable [2] provided /' is locally non-constant: In 
every interval there are x,y such that /'(;*:) <f'(y). We remark that when/' is locally 
nonzero, i.e. in every interval there is an x such that/'(x) > or/'(x) < 0, then / is 
locally non-constant, as is readily seen by integration. It follows that if f^ is locally 
nonzero, then for all k < n, f^ is locally nonconstant. We obtain an equal conclusion 
version of the generalized Rolle's theorem: Let / be n times differentiable and have 
n + 1 zeroes in an interval [a,b]. If, moreover, / (,l) is locally nonzero, then/^ has a 
zero in [a,b]. 

From this equal conclusion version, we can obtain an equal hypothesis version of 
Rolle's theorem. 

Proposition 2.4 Let f be n times differentiable and have n + 1 zeroes, x,-, in an 
interval [a,b]. Then there exists x G [a,b] such that [/' ( -" ) (x)| < e. 

Proof Either there exists x € ia,b) such that [/^(x)| < e or for all x € ia,b), 
\f( n \x)\ > e/2. In the former case, we are done. In the latter case, f("' is locally 
non-zero, hence by the remark above, we can follow the proof of Theorem 2.1 to 
conclude that/^'^ has a zero. A contradiction. □ 



4 



Thierry Coquand and Bas Spitters 



In this way we can derive the Lagrange error formula 2.2. One can argue however that 
this derivation is more complex than the classical result. On page 7 we give a smooth 
proof of Theorems 2.2 and 2.3, which holds both classically and constructively, and 
does not rely on Theorem 2.1. 



3 Hermite-Genocchi formula 



The definition in [3] of uniform differentiability of a function / on an interval / = [a, b] 
can be formulated as follows: there exists an uniformly continuous function F : I 2 — > R 
such that f(y) — f{x) = (y — x)F(x,y) for all x,y in /. We can then define f'(x) to 
be F(x,x). The following result is a generalization of this characterisation to n- 
differentiability. The proof is a simple application of the fundamental theorem of the 
calculus [3]. 

Theorem 3.1 A function /:/—>■ R is uniformly n-differentiable if, and only if, 
there exist n+l uniformly continuous functions fo(x), fi(xo,xi), . . . , f n (xo, ■ ■ ■ ,x n ) 
defined respectively on 1,1, . . . , I n+l and such that 

fo(x) =f(x), f (xi) -fo(x ) = (x x - x Q )fi{x Q ,x{), 

fn-li^O, ■ ■ ■ ,X„_2,X„) — f„_i(xo, . . . ,X n _2,Xn-l) = (x n ~ X n -l)fn(Xo, ■ ■ ■ ,X„) 

for all xq, . . . ,x„ in I. 
We then have 

f"\x) = n\f n (x,...,x) 

and, conversely, we can define 

(*) Mxo, ■ ■ ■ ,x n ) = / f {n \toXQ H h t n x n )dto . . . dt n 

where 1 E„ = {(f , ...,/„)€ [0, l] n+1 | to + ■ ■ ■ + t n = 1}. 

'This integral of a uniformly continuous function g over £„ can be defined by induction on 
n in the following way: for n = the integral is and for n > we define 

h(t , ...,/„_i) = / g(t , ...,t n -\u, f„_i(l - u))du 
Jo 

and the integral of g over £„ is the integral of h over S„_i . 
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Proof Assume that the divided differences are uniformly continuous on I, I 2 , . . . , I n+l . 
Since 

f(y)-f(x) = (y-x)Mx,y) 

we get that / is uniformly differentiable and f'(x) = f\ (x, x) . We then have 

f'(y)-f'(x) = fi(y,y)-Mx,x) 

= f\(y,y) -fi(y,x) +My,x) -fi(x,x) 
= (y - x)(fz(y,x, y) +fz(x, x, y)) 

and so/ is uniformly 2 -differentiable and/ (2) (.x) = 2f2(x,x,x). Proceeding in this way 
we see that/ is uniformly & -differentiable and that we have klfkiy, . . . ,y) = f { - k \y) for 
k = 0, . . . , n. 

Conversely, assume that / is uniformly n -differentiable. We define 

fk(xo, ...,Xk)= / f (k) (t Q xo H h t k x k )dt . . . dt k 

for k = 0, . . . , n. These functions are uniformly continuous. Furthermore, we have 
/o<X> =f(x) and 

fo(x) ~fo(xo) = (x- x Q )fi(x ,x) 

since 

(x - x )f l (x,x ) = / /'((l - t)x Q + tx)dt 
Jo 

holds for x apart from xq , by the fundamental theorem of the calculus [3] and hence 
for all x, xo by continuity. It follows that we have 

fk(XQ, ■ ■ ■ ,Xk-\,X) —fk(X0, ■ ■ ■ ,Xk-l,Xk) 

= / (f (k ~ l \toXo + --- + tk-ix)-f (k - i \tox Q + --- + t k „iXk))dto...dt k - l 
= (x- x k ) \ f (k) (t xo H h t k -ix k -i + t k x)dto ...dt k 

JT, k 

= (x- x k )fk+i(xo, ...,x k ,x) 
which shows that these functions satisfy the required equations. □ 

Notice that the functions f k (xo, . . . ,Xk) are symmetric in xo, ■ ■ ■ ,Xk and that we can 
then write f(x) as the sum 

Mxo) + (X- X )fi(XQ,Xi) H h (X - Xo) ■ ■ ■ (X - X^iYniXQ, ...,x n -i,x) 
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The equation (*) of Theorem 3. 1 known as Hermite-Genocchi formula. Genocchi found 
these formulae by analysing the notion of "fonctions interpolaires" due to Ampere [l] 2 . 

A direct consequence of Theorem 3.1 is the following result. 

Corollary 3.2 Gi ven a function f ; / — y IR and n + 1 distinct elements xq, . . . ,x n 
in I let P(xq, . . . ,x n ,x) be the interpolation polynomial a n ^i(xQ, . . . ,x n )x n ~ l + • • • + 
ao(xo, ■ ■ ■ ,x n ) of f at xq,. . . ,x n . Then P(xq, . . . ,x n ,x), seen as a function of the 
parameters xo, . . . ,x n , can be extended to an uniformly continuous function on I n+l 
(that is, each function dj(xo, ■ ■ ■ ,x n ) can be extended to an uniformly continuous 
function on I n+l ) if and only if f is uniformly n-differentiable. 

4 Applications 

We explain now how to derive Theorem 2.2 from Theorem 3.1. We assume that 
/ : / — > IR is uniformly Ti-differentiable. Given any n elements xq, . . . ,x n -\ in / we 
associate the Newton polynomial of degree n — 1 : 

P{x) :=/o(*o)+/i(xo,*i)(*-*o)H \-f n -i(xo, ...,x„-i)(x-xq)(x-xi) ■ ■ ■ (x-x n _ 2 )- 

We have 

f{x) - P(x) = (X- Xo) . . . (X - X n -l)f n (XQ, X„_l, X) 

On the other hand, we also have P(xj) = f(x{) for i = 0, . . . , n — 1 . Theorem 2.2 
follows then from the fact, direct from Theorem 3. 1 , that 

, M 

\fn(X0, ■ ■ ■ ,X„-1,X)\ < — 

n\ 

if we have \f {n \u)\ ^ M for all u m I . 

Notice that the Newton polynomial P(x) is denned for any xo, . . . ,x n -\ without re- 
quiring them to be distinct. We have P^ \xi) =f^ \xi) if X; is duplicated / times 
in this list. 

Theorems 3.1 is also valid for / : / — > E, where £ is a Banach space. Theorem 2.2 is 
also valid for / : / — > F, where F is a normed space. 

2 In the case where/ is a monic polynomial, the functions /(xo,jci), . . . , f n (xo, . . . ,x n -i) 
are also polynomial and they form with / a Grobner basis of the universal decomposition 
algebra of the polynomial/; see e.g. [7]. 
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4.1 A constructive proof of Simpson's rule 

As explained above, the typical proofs of Simpson's rule 2.3, see e.g. [5, 8], use Rolle's 
theorem, and so are not constructively valid. We adapt the proof in [8] which uses 
Rolle's theorem three times on triple zero at and the simple zero at 1 of the function 
H below. 

Define F(t) :=f( s ^- + ^t). This reduces the problem to showing that 

| J F{t)oIt - -(F(-l) + 4F(0) + F(l))| < 

where N := ||F (4) ||. 

Define ^ 

G{t) = J F(T)dT-^(F(-t) + 4F(0)+F(t)) 

We need to prove that 90G(1) < N. To do so, define H(t) := G(t) - t 5 G{l). Then 

H(0) = H(l) = H'(0) = H"(0) = 0. 

Hence, H 3 (0, 0,0, 1) = -(H 2 (0,0,0) -H 2 (0,0, 1) = + (-#i(0,0) + H l (0, 1)) = 0. 
(This line replaces three uses of Rolle's theorem.) 

Moreover, 

t t f' 

H®\t) = --(F (3 \t) - F°\-t)) - 60? 2 G(1) = --( / F (4) ) - 60f 2 G(l). 
3 3 J—t 

This shows that 

= #(0,0,0,1) = / H (3) 



1 t 







-( / F w ) - 60? Z G(1) 



-^-2tN-60f 2 G(l) 



o 
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= -\{N + 9QG{\)) / t 2 

~> JO 

= -^(AT + 90G(l))i 

Hence, N > -90G(1). Similarly, ^ + 90G(1)). Consequently, 90G(1) 

N. We conclude that |90G(1)| N. 

A similar argument works to justify e.g. Romberg's integration method [8] which 
generalizes Simpson rule. 
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